support for explicit test_dataset definition for evals #786

winglian · 2023-10-26T00:56:06Z

first pass for supporting different datasets for evals rather than splitting the test dataset.

NanoCode012

Hmm, our data pipeline is getting a bit complex. I had a bit of hard time following the flow

src/axolotl/utils/data.py

NanoCode012

Adding this config to Readme would be helpful

winglian · 2024-01-19T00:16:04Z

This will address #875

NanoCode012 · 2024-01-19T05:44:46Z

I think will need to add to doc about this and also, whether it would be appropriate to hardcode the train/test split or allow more open control?

winglian · 2024-01-22T14:38:45Z

I think will need to add to doc about this and also, whether it would be appropriate to hardcode the train/test split or allow more open control?

@NanoCode012 What do you mean about hardcoding?

DreamGenX · 2024-01-27T08:34:06Z

Do you have an example or documentation of how this can be used?

DreamGenX · 2024-01-27T15:15:16Z

I was trying to reserve engineer how it's supposed to work, and maybe there's a bug here:

https://github.com/OpenAccess-AI-Collective/axolotl/blob/18f811978c01d567c2294140f53abcf8c086e337/src/axolotl/utils/data.py#L443

    dataset, prompters = load_tokenized_prepared_datasets(
        tokenizer, cfg, default_dataset_prepared_path
    )

Shouldn't you pass split=split to load_tokenized_prepared_datasets? Otherwise it will never load from cfg.test_datasets.

NanoCode012 reviewed Nov 15, 2023

View reviewed changes

src/axolotl/utils/data.py Outdated Show resolved Hide resolved

winglian force-pushed the eval-dataset branch from 8f11779 to 3bcdab4 Compare November 16, 2023 18:22

winglian force-pushed the eval-dataset branch from 3bcdab4 to fa4f187 Compare January 2, 2024 03:22

winglian force-pushed the eval-dataset branch from fa4f187 to feed723 Compare January 9, 2024 19:02

NanoCode012 reviewed Jan 10, 2024

View reviewed changes

winglian force-pushed the eval-dataset branch from feed723 to fb72cb5 Compare January 10, 2024 16:43

support for explicit test_dataset definition for evals

4e5da2a

winglian force-pushed the eval-dataset branch from fb72cb5 to 4e5da2a Compare January 23, 2024 01:52

winglian merged commit cda52dc into main Jan 23, 2024
7 checks passed

winglian deleted the eval-dataset branch January 23, 2024 02:30

NanoCode012 mentioned this pull request Mar 30, 2024

Evaluate on specified data #875

Closed

5 tasks

djsaunde pushed a commit that referenced this pull request Dec 17, 2024

support for explicit test_dataset definition for evals (#786)

7ef355a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for explicit test_dataset definition for evals #786

support for explicit test_dataset definition for evals #786

winglian commented Oct 26, 2023

NanoCode012 left a comment

NanoCode012 left a comment

winglian commented Jan 19, 2024

NanoCode012 commented Jan 19, 2024

winglian commented Jan 22, 2024

DreamGenX commented Jan 27, 2024

DreamGenX commented Jan 27, 2024

support for explicit test_dataset definition for evals #786

support for explicit test_dataset definition for evals #786

Conversation

winglian commented Oct 26, 2023

NanoCode012 left a comment

Choose a reason for hiding this comment

NanoCode012 left a comment

Choose a reason for hiding this comment

winglian commented Jan 19, 2024

NanoCode012 commented Jan 19, 2024

winglian commented Jan 22, 2024

DreamGenX commented Jan 27, 2024

DreamGenX commented Jan 27, 2024